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Abstract 

In this paper we study random partitions of (1,. ..,n} where every cluster of 
size j can be in any of Wj possible internal states. The Gibbs (n, fc, w ) distribution is 
obtained by sampling uniformly among such partitions with k clusters. We provide 
conditions on the weight sequence w allowing construction of a partition valued 
random process where at step k the state has the Gibbs ( n,k,w ) distribution, 
so the partition is subject to irreversible fragmentation as time evolves. For a 
particular one-parameter family of weight sequences Wj , the time-reversed process 
is the discrete Marcus-Lushnikov coalescent process with affine collision rate K i : j = 
a + b(i + j) for some real numbers a and b. Under further restrictions on a and 
6, the fragmentation process can be realized by conditioning a Galton-Watson tree 
with suitable offspring distribution to have n nodes, and cutting the edges of this 
tree by random sampling of edges without replacement, to partition the tree into a 
collection of subtrees. Suitable offspring distributions include the binomial, negative 
binomial and Poisson distributions. 
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1 Introduction 


Gibbs models for random partitions generated by random processes of coagulation and 
fragmentation have been widely studied ([18], [3S], [50] . [33]). They typically arise as 
equilibrium distributions of time-reversible processes of coagulation and fragmentation 
(see for instance m , and [5] for general results about exchangeable fragmentation- 
coalescence processes in equilibrium). There is a much smaller literature in which Gibbs 
models are derived from an irreversible Markovian coagulation process [28]. This paper 
presents a Gibbs model for an irreversible Markovian fragmentation process. While 
Gibbs models for physical processes of fragmentation have been treated before, such 
models typically allow the possibility of both fragmentation and coagulation at the 
microscopic level, resulting in a Gibbs equilibrium at any given time. However as 
time evolves, this equilibrium is moving towards a more fragmented state, so using the 
language of thermodynamics this equilibrium should be seen as a quasistatic equilibrium 
of an adiabatic process. The point here is to provide a rigorous Markovian model of 
irreversible fragmentation at the microscopic level with no possibility of coagulation 
allowed. 

The simplest way to describe the process treated here is to specify its time reversal. 
This is the Marcus-Lushnikov coalescent process with collision rate kernel K t J = a+b(i+ 
j ) for some constants a and b, where Kj j represents rate of collisions between clusters 
of i particles and clusters of j particles. This model was solved by Hendriks et al. [28], 
who showed that the distribution at time t in such a coalescent process started from a 
monodisperse initial condition is a mixture of microcanonical Gibbs distributions with 
mixing coefficients depending on t. Here, we derive what turns out to be essentially the 
same model, modulo time reversal and a formulation in discrete rather than continuous 
time, but from a different set of assumptions describing the evolution of the process with 
time running in the direction of fragmentation. The probabilistic link between the two 
sets of assumptions is a time reversal calculation using Bayes rule. The most interesting 
feature of this calculation is that starting from a natural recursive assumption for the 
fragmentation process in terms of Gibbs distributions, there is only one-parameter family 
of possible solutions to the problem, with the parameter corresponding to the ratio of the 
two parameters a and b in the collision rate kernel of the reversed time process. Another 
interesting feature is that for some but not all a and b, the fragmentation process for 
partitions of a set of size n can be realized by conditioning a Galton-Watson process with 
suitable offspring distribution to have a family tree of size n, then cutting the edges of 
this tree by a process of random sampling without replacement. The suitable offspring 
distributions include the binomial, negative binomial and Poisson distributions. 

1.1 Canonical and microcanonical Gibbs distribution 

Typically, the state of a coagulation/fragmentation process is represented by a random 
partition of n, that is a random variable with values in the set V n of all partitions of 
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n. In later sections of this paper the state of the process will be represented rather 
as a random partition of the set {1,2, ...,n}, as this device simplifies a number of 
calculations. But the rest of this introduction follows the more common convention of 
working with the set V n of partitions of the integer n. Let 

A = 1 C1 2 C2 • • • n Cn (1) 


denote a typical partition of n. Regarding the state of the system as a partition of 
n particles into clusters of various sizes, the state A in (jTD indicates that there are Cj 
clusters of size j for each 1 < j < n. Note that ^ ■ jcj = n, the total number of particles. 
The total number of clusters is k := yh Cj. The numbers c\,<>i,... ,Cj ... may be called 
numbers of monomers, dimers, ... j-mers, or numbers of singletons, doubletons, ... 
j-tons. The Gibbs model most commonly derived from equilibrium considerations is the 
canonical Gibbs distribution on partitions of n with weight sequence ( Wj ) defined by 


P{ A | n\w 1 ,w 2 , ■ ■ .,w n ) 


I n * (?)“ 

1=1 


( 2 ) 


where 

Y n = Y n (w 1 ,w 2 ,-..,w n ) (3) 

is a normalization constant. This polynomial in the first n weights W\,W 2 , ■ ■ ■ ,w n is 
known in the combinatorics literature as the complete Bell (or exponential ) polynomial 
[S]. In the physics literature the Gibbs formula @ is commonly written in terms of 
%i = Wi/il instead of Wi, and the polynomial 


Z n (xi,x 2 , • • .,w n ) := n\Y n (l\xi,2\x 2 , ■ ..n\x n ) 


(4) 


is called the canonical partition function. For textbook treatments of such models, and 
references to earlier work see [44] , Typically, the canonical Gibbs distribution (O is 
derived either from thermodynamic considerations, or from a set of detailed balance 
equations corresponding to a reversible equilibrium between processes of fragmentation 
and coagulation. In the latter case the canonical Gibbs distribution is represented as 
the equilibrium distribution of a time-reversible Markov chain with state space V n . 
For related models, see [2], and Vershik m, who also considers a variation of Gibbs 
distributions in the context of quantum statistical physics, where he derives asymptotics 
for the limiting shape of a Gibbs partition associated with the Bose-Einstein statistics by 
considering a certain variational problem. See also m where partitions are subject to 
a natural additional constraint of consistency corresponding to infinite exchangeability. 

Conditioning a canonical Gibbs distribution on the number of clusters k yields a 
corresponding microcanonical Gibbs distribution for each 1 < k < n. This distribution 
assigns to the partition A displayed in (|TJ) the probability 

I n i 

P(A| n,k;w 1 ,w 2 ,...,w n ) = (5) 

Bn,k Cj' V *! > 
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where B Uy k = B n ^{w\,w 2 ,...) is a partial Bell (or exponential) polynomial , and 

Z n ,k(x i,x 2 ,...,x n ) := n\B ntk (l\xi,2\x 2 ,... ,n\x n ) (6) 

is known as a microcanonical partition function. A great many expressions, represen¬ 
tations and recursions for these polynomials B n k and Z n ^ are known [S 03] . These 
formulae are useful whenever the weight sequence (wj) is such that the associated poly¬ 
nomials admit an explicit formula as functions of n and k, or can be suitably approx¬ 
imated (see e.g. ED]). Some of these results are reviewed in [13]. The class of weight 
sequences (wj) for which the microcanonical Gibbs model is “solvable”, meaning there 
is an explicit formula for the B n ^. is quite large. 

In the study of irreversible partition-valued processes, it has been found in several 
cases (discussed below) that the distribution of the process at time t is a probabilis¬ 
tic mixture over k of microcanonical Gibbs distributions, that is to say a probability 
distribution of the form 


n 

n» = £ Qn,kP{^\ n, k-,w 1 ,W 2 ,...,w n ) ( 7 ) 

k =1 

where q Uj k represents the probability that A has k components, so q n ^ > 0 and Ylk =l 9re,fc = 
1, and both q n ^ and the weight sequence Wi may be functions of t. We call any 
distribution of the form (0 a Gibbs distribution with weights ( Wj ), thereby including 
both canonical and microcanonical Gibbs distributions. For example, Lushnikov [39] 
showed that the coalescent model with monodisperse initial condition and collision rates 
K x ,y = xf(y ) + yf(x ) leads to such Gibbs distributions. Hendriks et. al [28] showed 
that this is also the case for K XtV = a + b(x + y) for constants a and b. See also [ IT 
Section 1.5] for an interpretation of Gibbs distributions with integer weights in terms 
of composite combinatorial structures. 

1.2 Organization and summary of the paper. 

The rest of this paper is organized as follows. Section [2] presents some background 
material, and introduces the formalism of Gibbs distributions over partitions of the set 
{1,2,..., n}. Results for irreversible fragmentation processes are presented in Section 
[3] Section [4] presents the main result of the paper. This result states that, under an 
additional set of assumptions (most notably, the linear selection rule), it is possible 
to construct a Gibbs fragmentation process with weight sequence (wj ) if and only if 
Wj = Uin=2 (mc + jb ) for some constants b and c: in this case it is shown that the time- 
reversal of the fragmentation process is the discrete Marcus-Lushnikov coalesent with 
affine coalescent rate: K, j = a + b(i + j). Section 5 provides some background material 
on generating functions and branching processes which is needed for the evaluation of a 
particular Bell polynomial. Section 6 then presents the proofs of the results of Section 
4. These results leave out an important case, which is that of the sequence Wj = (j — 1)!. 
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In section [71 we approach this problem from the angle of Kingman’s coalescent process 
and the Ewens sampling formula. In particular we construct a continuous analogue of 
the desired process. However, we show that the existence of this process with discrete 
time cannot be obtained by taking the discrete-time chain embedded in the continuous 
process, so that its existence remains an open question. 

2 Preliminaries 

Let [re] denote the set {1,... ,n}. A partition of [n] is an unordered collection of non¬ 
empty disjoint subsets of [re] whose union is [n]. A generic partition of [re] into k sets 
(sometimes also referred to as blocks or components of the partition) will be denoted 

7Tfc = {Ai ,..., Afc}, where 1 < k < n (8) 

and where blocks are numbered according e.g. to their least element. Let denote 
the set of all partitions of [n], and let V[ n ^] be the subset of V [„] comprising all partitions 
of [n] into k components. Given a sequence of weights ( Wj,j = 1, 2, ■ ■ ■) define the mi- 
crocanonical Gibbs distribution on V\ n _k\ with weights (w i,u> 2 , ■ ■ ■) to be the probability 
distribution on V^ n ^] which assigns to each partition as in <[8|) the probability 

Pn,Wl,W 2 ,...W n ) = — - (9) 

’ i=l 

where ffA^ denotes the number of elements of A j and 

k 

B n ,k := B ntk (wi, ...,w n ) := ^ ( 10 ) 

TTfcG^n.fc] i= 1 

where for it f. £ the n,(7Tj.) for 1 < i < k are the sizes of the components of 7in 

some arbitrary order. Throughout this paper we will use the notation p H) k{ ■; w\, ..., w n ), 
or simply p n ^ if no confusion is possible, for the microcanonical Gibbs distribution ([9]) 
determined by the sequence (w\,... ,w n ). Remark that as soon as k > 2, the mi¬ 
crocanonical Gibbs distribution p n j ; actually only depends on (ici,..., w n -\). Given a 
partition 7r of [n], the corresponding partition A of n is A := 1 C1 2 C2 • • • n Cn as in (pQ) where 
Cj is the number of components of 7r of size i. For each vector of non-negative integer 
counts (ci,... ,Cn) with 'fh i ic-i = n the number of partitions 7r of [re] corresponding to 
the partition 1 C1 2 C2 ■ • • n Cn of re is well known to be 


n \ / 

IRW‘ 

i =1 

The probability distribution on partitions of re induced by the microcanonical Gibbs 
distribution on Vy nt k\ with weights (w\,W 2 , ■ ■ •) therefore identical to the microcanonical 
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Gibbs distribution on V n with the same weights (w\,W 2 , ■ ■ •) as defined in (JSJ), and there 
is the following standard expression for B n ^ [9J: 


B 


^ =n! En 

Xv. i =1 


a\ 


( 12 ) 


where the sum is over all partitions A& of n into k components, and c, = cfi A*,) is the 
number of components of A& of size i. Thus transferring from Gibbs distributions on 
partitions of n into k components to Gibbs distributions on partitions of the set [n] into 
k components is just a matter of keeping track of the universal combinatorial factor 

m- 


2.1 Combinatorial Interpretation 

The following well-known interpretations provide both motivation and intuition for the 
study of Gibbs distributions and Bell polynomials. Suppose that n particles labelled 
by elements of the set [n] are partitioned into clusters in such a way that each particle 
belongs to a unique cluster. Formally, the collection of clusters is represented by a par¬ 
tition of [n]. Suppose further that each cluster of size j can be in any one of wj different 
internal states for some sequence of non-negative integers (wj). Let the configuration of 
the system of n particles be the partition of the set of n particles into clusters, together 
with the assignment of an internal state to each cluster. For each partition n of [n] with 
k components of sizes ..., n*,, there are n!=i ®n, different configurations with that 
partition n. So B n ^(w\ , W 2 , ...) defined by (flOl) gives the number of configurations with 
k clusters', the Gibbs distribution (J9|) with weight sequence (wfi is the distribution of 
the random partition of [ n] if all configurations with k clusters are equally likely, and 
formula ((H) describes the corresponding Gibbs distribution on partitions of n induced 
by the same hypothesis. 

Many particular choices of (wj) have natural interpretations, both combinatorial and 
physical. In particular, the following four examples have been extensively studied. Many 
more combinatorial examples are known where Gibbs distributions arise naturally from 
an assumption of equally likely outcomes on a suitable configuration space. Related 
problems of enumeration and asymptotic distributions have been extensively studied 

mmmmm- 

2.2 Some important examples 

We recall here some natural examples of Gibbs distributions for particular sequences 
of weights (wfi, and their combinatorial interpretations, which motivate our work in 
following sections. 

Example 1. Uniform distribution on partitions of [n]. Take Wj = 1 for all j. Then 
a configuration is just a partition of [n], so that B n ^( 1,1,...) is the number of parti¬ 
tions of [n] into k components, known as a Stirling number of the second kind. The 
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microcanonical Gibbs model p n p corresponds to assuming that all partitions of [re] into 
k components are equally likely. 

Example 2. Uniform distribution on permutations. Suppose that the internal state of 
a cluster C of size j is one of the (j — 1)! cyclic permutations of C. Then Wj = (j — 1)!, 
and each configuration corresponds to a permutation of [n]. Therefore B n ^{ 0!, l!, 2!...) 
is the number of permutations of [re] with k cycles, known as an unsigned Stirling number 
of the first kind. The microcanonical Gibbs distribution p n ^ is the distribution on V n 
induced by a permutation uniformly chosen among all permutations with k cycles. 

Example 3. Cutting a rooted random segment [44] . Suppose that the internal state of 
a cluster C of size j is one of j\ linear orderings of the set C. Identify each cluster as 
a directed graph in which there is a directed edge from a to b if and only if a is the 
immediate predecessor of b in the linear ordering. Call such a graph a rooted segment. 
Then B n ^{ l!, 2!, 3!...) is the number of directed graphs whose vertices are labelled by 



Figure 1: Cutting a rooted random segment. 

[re] with k such rooted segments as its components. In the previous two examples, 
explicit formulae for the B n % are fairly complicated. But this time there is a simple 
formula: 

B nM 1!, 21,3!...)=(”:])£ (13) 

is known as a Lah number [9^ p. 135]. The Gibbs model in this instance is a variation 
of Flory’s model for a linear polymerization process [20] . Another interpretation is 
provided by Kingman’s coalescent mm- It is easily shown in this case that a sequence 
of random partitions (life, 1 < k < re) such that IT^. has the microcanonical Gibbs 
distribution with k blocks, can be obtained as follows. Let G\ be a uniformly distributed 
random rooted segment labelled by [re], and let Gk be derived from G\ by deletion of a 
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set of k — l edges picked uniformly at random from the set of n—1 edges of G±, and let 11/,. 
be the partition induced by the components of Gk■ If the n — 1 edges of G\ are deleted 
sequentially, one by one, the random sequence (III, n 2 , ..., II n ) is a refining sequence of 
random partitions such that IR has the Gibbs microcanonical distribution (|9|). This is 
illustrated in figure [0 The time-reversed sequence (II n , n n _i, ... ,11!) is then governed 
by the rules of Kingman’s coalescent: conditionally given IR with k components, IR_i 
is equally likely to be any one of the (£) different partitions of [n] obtained by merging 
two of the components of IR. Equivalently, the sequence (III, II 2 ,... , IR) has uniform 
distribution over the set 7 Z n of all refining sequences of partitions of [n] such that the 
kth term of the sequence has k components. The consequent enumeration fflZ n = 
n!(n — l)!/2 n_1 was obtained by Erdos et al [18J . The fact that IR determined by this 
model has the microcanonical Gibbs distribution with k blocks and weight sequence 
Wj = j\ was obtained by Bayewitz et. al. [3j and Kingman |34j . 

Example 4. Cutting a rooted random tree [44| . Suppose the internal state of a cluster 
C of size j is one of the R* -1 rooted trees labelled by C. Then 2 2_1 ,3 3-1 ,...) 



Figure 2: Cutting a rooted random tree with 5 edges 

is the number of forests of k rooted trees labelled [n]. This time again there is a simple 
formula for B Hj k- As a consequence of Cayley’s enumeration of forests [42, 43] 

(14) 

The Gibbs model in this instance corresponds to assuming that all forests of k rooted 
trees are equally likely. This model turns up naturally in the theory of random graphs 
and has been studied and applied in several other contexts. The coalescent obtained 
by reversing the process of deleting the edges at random is the additive coalescent as 
discussed in [43] . 

3 Fragmentation Processes 

Recall that V [ n ] is the set of partitions of [n] := {1,... ,n}. Call a 7 ? [„|-valued random 
process (IR,f e /), with index set I a subset of real numbers, a fragmentation process 
if with probability one both 

(i) for every pair of times s and t in I with s < t the partition 11* is a refinement of 
IT;, and 
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(ii) for each 1 < k < n there is some t € I such that lit has k components. 

When a confusion is possible we will use the notation 11(f) if / is a continuous interval 
and lit i n the case where I is a discrete subset of the real numbers. We emphasize that 
throughout the paper, due to condition (ii), the fragmentation processes we consider 
are binary. In other words, whenever a split occurs, the split is a binary split in which 
one and only one block of the partition splits in two, thereby incrementing the number 
of components by 1. This condition also forces 11(f) to be the partition of [n] with one 
component of size n for all sufficiently small f G /, and to be the partition of [n] into n 
singletons for all sufficiently large f € I. 

Given a sequence of numbers (wi,u> 2 , ■ ■ ■, w n - 1 ), call (lit, f € I) a Gibbs fragmenta¬ 
tion process with weights (w\, ..., w n -\) if for every f £ I and 1 < k < n, the conditional 
distribution of lit given that lit has k components is the microcanonical Gibbs distri¬ 
bution p n ^ on Vy n u as defined by @. Note that if (II*,, k £ [n]) is a Gibbs fragmen¬ 
tation process, then the unconditional distribution of Lt*, is also p n ,k, because condition 
(ii) implies that lit, has k components with probability 1. Finally, the time-reversal 
(II n , II„_i,..., IIi) of any fragmentation chain (IIi,...,II n ), is called a coalescent. 

A basic problem, only partially solved in this paper, is the following: 

Problem 1. For which weight sequences (w \,..., w n -\) does there exist a V^-valued 
Gibbs fragmentation process with these weights? 

The above definitions were made in terms of -valued processes, as this formalism 
seems most convenient for computations with Gibbs distributions. Parallel definitions 
can be made in terms of P n -valued processes, using the partial ordering of refinement 
on V n defined as follows: for partitions A and p of n, A is a refinement of p if and only 
if there exist corresponding partitions X' and p! of [n] such that X' is a refinement of p!. 
Less formally, some parts of A can be coagulated to form the parts of p. The notions 
of Gibbs distributions and refining sequences transfer between V n and P[ n ] in such a 
way that the following results can be formulated with either state space. The rnany- 
to-one correspondence between partitions of the set [n] and partitions of the integer 
n, quantified by OB, provides a many-to-one correspondence between P n -valued and 
Pr n i-valued processes, in such a way that the partial ordering of refinement is preserved. 
Thus for a -valued Gibbs fragmentation process there is a corresponding Pn-valued 
fragmentation process, and vice-versa. 

To see that Problem Q] is of some interest, note that Example [3] (cutting a random 
rooted segment) provides a Pr n i-valued Gibbs fragmentation process with weights Wj = 
jl for each n. Example [I] (cutting a random rooted tree) does the same thing for the 
weights Wj = . What about for the sequence Wj = 1 of Example [l] (uniform random 

partitions) or the sequence Wj = (j — 1)! of Example [2] (uniform random permutations)? 
In these examples it is not obvious how to construct a Gibbs fragmentation process for 
n > 4. (Note that for n < 3 there exists a P[ n ]-valued Gibbs fragmentation process for 
arbitrary positive weights w± and W 2 , for trivial reasons.) The question for Wj = 1 is 
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largely settled by the following proposition, which can be traced as far back as m (see 
also EU). See Section [7] regarding Wj = (j — 1)!. 

Proposition 1. There is an no < oo such that for all n > n o there does not exist a 
V[ n ]-valued Gibbs fragmentation process (II k ,k € [n]) with equal weights w\ = W 2 = 

■■■ = Wn- 1- 

Proof. Let IIj n denote a random partition with the Gibbs distribution on P[ n ,k] with 
equal weights w\ = W 2 = • • • = w n - l, meaning that IL n)fe ] has uniform distribution on 
'P[n,k] • Let 

X( n,k,l ) ^ -^(n,fc,2) — ' " " — -^(n,fc,fc) (Ih) 

denote the sizes of components of II[ n ^ arranged in decreasing order. Then for each 
fixed i and k with 1 < i < k the zth largest component of has relative size X^ nki ^/n 
which converges in probability to 1/k as n —> oo. This follows easily from the law of 
large numbers, and the elementary fact that LL n y has the same distribution as II* k 
given that II* k has k components, where II* k is the random partition of [n] generated 
by n independent random variables U\, ..., U n each with uniform distribution on [&]. 
(So i and j are in the same component of II* k if and only if Ui = Uj .) In particular, 
there is an no < oo such that for all n > no both 

P(*(n,2,2) > (5/12)n) > 1/2 (16) 

and also 

P(X (n>3il) > (5/12)n) < 1/2 (17) 

But if (II[ raifc ],l < k < n) were a fragmentation process, then would be derived 

from n K2 ] by splitting one of the two components of II[ n)2 ], and hence X( nj2 2 ) < ^(n, 3 ,i) 
with probability one. Thus for a fragmentation process, (1161) implies the reverse of the 
inequality (fTTl) . and this contradiction yields the result. □ 


The above argument proves the non-existence for large n of a Gibbs fragmentation 
process for any weight sequence (wj) such that for k = 2 or k = 3 the components 
in the Gibbs partition of [n] into k components are approximately equal in size with 
high probability. For the weight sequence Wj = j\ of Example [3j what happens instead 
is that the sequence of ranked sizes (I15|) . normalized by n, has a non-degenerate limit 
distribution for each k. As observed in [341 §5], this limit distribution on [0, l] fc is the 
distribution of the ranked lengths of k subintervals of [0,1] obtained by cutting [0,1] at 
k — 1 points picked independently and uniformly at random from [0,1]. This asymptotic 
distribution has been extensively studied [22j. For the weight sequence Wj = j- 7-1 of 
Example HJ the behavior is different again. What happens is that for each fixed k 
the sequence of ranked sizes (1151) . when normalized by n, converges in probability to 
(1,0,... 0). That is to say, for any fixed k, for sufficiently large n, after k steps in the 
fragmentation process, there is with high probability one big component of relative mass 
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nearly 1, and k — 1 small components with combined relative mass nearly zero. To be 
more precise, it is easily shown that the k — 1 small components, when kept in the order 
they are broken off the big component, have unnormalized sizes X n j ,... ,X nk _\ that 
are approximately independent for large n with asymptotic distribution 

ji- 1 

lirn F(X ni = j ) = ——e 3 ( i,j = 1,2,...) (18) 

n— kx ) j ! 

which is the Borel distribution of the total progeny of a critical Poisson-Galton-Watson 
process with Poisson(l) offspring distribution started with one individual, which can 
be read from (ED and ED- See [331 §4.1] for proofs and various generalizations. As 
a consequence of (fT8l) and the asymptotic independence of the X n> \,..., X njk -i, the 
asymptotic distribution of the combined size X n> i + ... + X n ^~i of all but the largest 
component of the partition of [n] into k components is the distribution of the total 
progeny of the Poisson-Galton-Watson process with Poisson(l) offspring distribution 
started with k individuals, which is the Borel- Tanner distribution m 

U _ 1 m m-k+l -m 

lim P(T n i + ... + X n k _i = m) = - - - —— (19) 

n—>oo m (m — k + 1)! 

which can also be read from (1471) and (1491) . According to the classification of Barbour 
and Granovsky [3], the examples corresponding to Wj = j\, Wj = j 3 ^ 1 and Wj = (j — 1)! 
belong respectively to the expansive, convergent and logarithmic structures. These 
structures exhibit quite a different asymptotic behavior, which may account for the 
differences observed here between these three examples. 

As a contrast to Proposition [H it is known [50l fl5] that for any strictly positive 
sequence of weights {wj ), there is a reversible coagulation-fragmentation process on V\ n ] 
with the canonical Gibbs distribution ([2D as its equilibrium distribution. 

4 Existence of Gibbs fragmentation processes 

The problem of the existence of a Gibbs fragmentation (ITi,..., II n ) for a given integer 
n and weight sequence (uq,... ,w n - 1 ) is one of existence of an increasing process on 
a partially ordered set with contraints on the marginal distributions of the process. 
In principle, this is solved by the work of Strassen on measures with given marginals. 
See for instance [321 Theorem 1 and Proposition 4], According to this result, for the 
existence of a Gibbs fragmentation process it is both necessary and sufficient that for 
all A C V[ nt k], _ 

^ ~2Pn,k{ ?r) < ^ Pn,k+lix') (20) 

7rGA 7 t'gA' 

where A' is the set of partitions ir' that can be obtained by splitting a single block of 
some partition ir £ A, and p ntk is the Gibbs measure on partitions of [n] into k blocks. 
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A variation of this condition can also be given in terms of integer partitions rather than 
set partitions. Unfortunately, it seems hard to use this general criterion to prove any 
existence result. However m can be used as an algorithm to check the existence of a 
Gibbs fragmentation for a finite n and a given sequence (uq,... ,w n -i). For instance, 
when wj = 1 it is possible to check that the first n for which the existence of a Gibbs 
fragmentation fails is n = 20, as mentioned in m- 

If there exists any -valued Gibbs fragmentation process governed by (w \,..., w n - 1 ), 
then there exists one that is a Markov chain. For given a non-Markovian process, one 
can always create a Markov chain with the same one-step transition probabilities and 
the same marginal distributions. So Problem Q] reduces to: 

Problem 2. For which weight sequences (uq,..., w n -\) does there exist a transition 
matrix {P(ir,n)} indexed by Vy n \ such that P(tt,u) > 0 only if u is a refinement of tv, 
and if ve P^ k] 

Y Pn,fc-l(vr)P(7T, v) = p n , k (y) (2 < k < n) (21) 


where p n ,k{y ) given by the microcanonical Gibbs formula ([9]) ? 

Such a transition matrix P(n, v) corresponds to a splitting rule which describes for 
each 1 < k < n — 1 and each partition it of [n] into k — 1 components, the probability 
that 7r splits into a partition v of [n] into k components. Given that n^_i = Tr k -i 
with 7Tfe_i = {A^, ... , A' k l \ say, the only possible values n k of are those n k = 
{Ai,,A k } such that two of the Aj form a partition of one of the A), and the remaining 
Aj are identical to the remaining A). The initial splitting rule starting with 7iq = {[n]} 
is assumed to be specified by the Gibbs distribution p n ^ determined by the weight 
sequence (uq,..., ic n _i) for n\ and n ,2 with n\ + ri 2 = n. That is from (|9jk 


p(n 2 = {A 1 ,A 2 }) 


W #M W #M 

B n ,2(wi, . . .,W n -i) 


( 22 ) 


for B 7h2 as in (HOI) . The simplest way to continue is to use the following: 

Recursive Gibbs Rule: for each 1 < k < n — 1, given that n^_i = {A^,... ,A' k l } 
and that some particular block A £ {A \,..., A' k l } is split with ffA = m > 2, is 
obtained by splitting this block is split into {Ai, A 2 } with probability given by the right 
side of (|22l) with m instead of n. 

To complete the description of a splitting rule, it is also necessary to specify for each 
partition Ttk-i = {A),..., A^._ 1 } the probability that the next component to be split is 
A', for each 1 < i < k — 1. The simplest possible assumption seems to be the following: 

Linear Selection Rule: Given ir k _i = { A \,..., AI 1 }, split A) with probability pro¬ 
portional to ffA'j — 1, that is with probability (#A( — l)/(n — k + 1). 

While this selection rule is somewhat arbitrary, it is natural to investigate its im¬ 
plications for the following reasons. Firstly, blocks of size 1 cannot be split, so the 
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probability of picking a block to split must depend on size. This probability must be 0 
for a block of size 1, and 1 for a block of size n — k + 1. The simplest way to achieve 
this is by linear interpolation. Secondly, both the segment splitting model and the tree 
splitting model described in Examples [3] and [5] follow this rule. In each of these exam¬ 
ples a block of size m is a graph component with m — 1 edges, so the linear selection 
rule corresponds to picking an edge uniformly at random from the set of all edges in 

the random graph whose components define II*_i. Given two natural combinatorial 

examples with the same selection rule, it is natural to ask what other models might 
follow the same rule. 

More complex splitting rules are also of interest. Consider for instance a contin¬ 
uous time Markov fragmentation chain which fragments blocks of size j according to 
infinitesimal rates dictated by A jpj^ for some A j > 0. The embedded discrete time chain 
is then a recursive Gibbs fragmentation chain with the property that the probability 
to select a particular block of size n* for the next fragmentation is proportional to \ ni . 
But we do not know any nice description of the law of 11* in this case beyond saying 
that it is the solution of some Kolmogorov forward equations. See also |26| for theory 
of discrete and continuous time Markov fragmentation chains which are exchangeable 
and consistent as n varies, meaning that they can be associated with fragmentations of 
a mass continuum. 

4.1 Main results 

This section presents the main results, whose proofs are provided in the next two sec¬ 
tions. Recall the definition of the discrete Marcus-Lushnikov coalescent process on V\ n ] 
with affine kernel: this is the unique Markov chain on such that 7Ti is the partition 
consisting of singletons and ir^ is obtained from TTk-i by merging each pair of blocks 
of sizes i and j with probability proportional to K,j = a + b(i + j) for some constants 
a and 6. In the case a = 1 and 6 = 0 this is Kingman’s n-coalescent, as described in 
Example 3 (blocks coalesce at rate 1), while if a = 0 and 6=1 this is the additive 
coalescent mentioned in Example 4. 

Theorem 2. Fix n > 4, and let ( Wj , 1 < j < n — 1) be a sequence of positive weights 
with w\ = 1. The following two statements are equivalent: 

(i) The V^-valued fragmentation process (IT*, 1 < k < n) defined by the recursive 
Gibbs splitting rule derived from these weights, with the linear selection rule, is 
such that for each 1 < k < n the random partition IR has the microcanonical 
Gibbs distribution p n ^ with the same weights. 

(ii) The weight sequence wj is of the form 

j 

Wj = w b .’ c ■= Y[(ic + jb ) , (j = 2,..., n - 1) (23) 

i =2 
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for some real b and c such that 

b + c> 0 and either b > 0 or b < 0 and c > — (n — 1 )b/2. (24) 

(iii) The time reversal of (11^,2 < k < n) is a discrete Marcus-Lushnikov coalescent 
with affine kernel K, j = a + b(i + j). Moreover, in this case the b is the same as 
in (ii) and a = 2c. 

Note that c and b appearing in (ii) and (iii) are unique only up to a constant common 
factor. 

Given a continuous time "P[ n ]-valued coalescent or fragmentation process (n(t),t e 
I), define the discrete skeleton of (II (t),t € I) to be the TGi-valued process (II£,1 < 
k < n) where IIjj) is the common value of II(t) for all t £ I such that ffH(t) = k. 
Provided either b > 0 or b < 0 and c > —rib /2 the time-reversed process in part (iii) of 
the above theorem is the discrete time skeleton of the continuous time affine coalescent 
with collision rate kernel K t ^ := 2c + b(i + j). As observed by Hendriks et al. [28], 
this kernel has the special property that the process (ffU(t),t > 0) is independent of 
the discrete skeleton of (n(f),f > 0) (in the special case of Kingman’s coalescent this 
had been proved earlier in [34] ). Thus Theorem [2] implies the result of Hendriks et al. 
[28] that for an affine coalescent in continuous time the distribution of n< is a Gibbs 
distribution with weights w b f c as in (12.11) . that is a mixture over k, with mixing weights 

depending on t, of the microcanonical Gibbs distributions p b )° k featured in Theorem [2j 
The fact that (n (t),t > 0) is a Gibbs coalescent with a particular sequence of weights 
Wj is related in this instance to the fact that its discrete skeleton is a Gibbs coalescent 
with the same weights. But this equivalence relies on the independence of the process 
(ffU(t),t > 0) and its discrete skeleton. It is not always true that the discrete skeleton 
of a continuous time Gibbs coalescent is a discrete time Gibbs coalescent, as illustrated 
by example in Section [7] 

The following corollary was suggested by comparison of Theorem [2] with the branch¬ 
ing process interpretation of Bell polynomials provided in Section 15.21 To simplify the 
argument we introduce a regularity condition on the offspring distribution (1251) , but this 
assumption may not be strictly necessary for the result to stay valid. 

Corollary 3. Fixn > 4. Let T denote a Galton-Watson tree with offspring distribution 
( pj ) such that 

Pj > 0 if and only if 0 < j < j\ for some 1 < j\ < oo. (25) 

Let F\ be T conditioned to have n nodes, regarded as a random plane tree (a tree with 
ordered branches), and for 2 <k<n let Fk be the plane forest of k trees obtained by first 
cutting k — 1 edges of F\ picked by a process of random sampling without replacement, 
and then putting these k trees in random order, with all k\ orders equally likely, where 
the cutting and ordering processes are independent of each other and of F\. Then the 
following conditions are equivalent: 
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(i) The offspring distribution is such that 


Pf 

Po 


if[(6-(i-2) c ) 

J' 4—1 


( 26 ) 


for some real parameters b and c with b + c > 0 and such that the product is 
non-negative for all 1 < j < n — 1. 


(ii) The forest of two trees F 2 is distributed like two independent copies of T condi¬ 
tioned to have a total of n nodes. 

(iii) For every 1 < k < n the forest of k trees F^ is distributed like k independent copies 
of T conditioned to have a total of n nodes. 

For such a sequence of forests (Ff.. 1 < k < n) let (11^, 1 < k < n) be the refining 
sequence of partitions of [n] defined by labelling the n nodes of tree F\ by a random 
permutation independent of F\, and letting the blocks of II*, be the tree components of 
Ffr. Then 


(iv) the sequence of partitions (Iii,..., II n ) develops by recursive Gibbs fragmentation 
with linear selection, for the weight sequence {w b f c ) as in (EH), and (n n ,...,IIi) 
is a Marcus-Lushnikov coalescent with the affine kernel Kjj = 2c + b(i + j). 

The implications (i) => (ii) => (iii) =>■ (iv) of this Corollary were provided in [43] for 
the case c = 0, when the offspring distribution can be Poisson with mean b for any b > 0. 
Note that (i) only specifies the conditional offspring distribution given at most n — 1 
children, as is necessary for the converse for a fixed n. The conditions on b and c imposed 
in (i), which are necessary for construction of the forest-valued fragmentation (i 7 *.), imply 
but are not implied by the conditions (|24l) which are necessary for construction of the 
partition-valued fragmentation (life). To illustrate for n = 4, the conditions (1241) are 
that b+c > 0 and 36+2c > 0, whereas those in (i) above are b+c > 0 and either b—c > 0 
or 6 = 0. In either case, 6 > 0, hence 36 + 2c > 0, but not conversely. The 6 and c such 
that the conditions (1241) hold for all n are those with 6 > 0 and 6 + c > 0. Whereas there 
is the forest-valued representation for all n if and only if one of the following further 
conditions holds, as discussed later in Section 15.21 

• c = 0: the offspring distribution is then Poisson(6); 

• c > 0 and 6 = (o — l)c for a positive integer a: the offspring distribution is then 
binomial(a,p) for p = c/(c+ 1); 

• — 1 < c < 0 and 6 = (a — l)c with — a = r > 0: the offspring distribution is then 
negative binomial(r,p) for p = c + 1. 
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The cases when a is an integer admit further combinatorial interpretations, which we will 
discuss in more detail elsewhere. For instance, when a = — 1 we obtain a representation 
of the affine coalescent with collision kernel Kjj = i + j — 1 by time reversal of a process 
of coalescent plane forests (F k , 1 < k < n) , where the forest with k trees has the uniform 
distribution on the set of 

k /2n — k — l\ 
n\ n—k J 

plane forests with k trees. And when a is a positive integer, there is an interpretation 
of the (a — l)(i + j ) + 2 coalescent in terms of trees where each node has either 0 or a 
children. 


5 Preliminaries 


5.1 Generating functions 


Let w(z) := w n z n /n\ be the exponential generating function associated with the 

sequence of weights (w n ). It follows easily from (TlQl) that 

771 , 

B ntk (w 1 ,w 2 ,...) = -[z n ]w(z) k (27) 

where [z n ]w(z) k denotes the coefficient of z n in the expansion of w(z) k in powers of z. 
In particular, 

1 n_1 (n 

B n ,2{w\, ■ ■ ■ ,w n - 1 ) = 2 ^ 1 / 

1 i=i '■ 

Assuming the weights are such that w(£) < oo for some ^ > 0, the formula 


WlWn-l- 


(28) 


P(y = n) 


Wnt n 

n\w(£) 


defines the distribution of a non-negative random variable Y whose probability gener¬ 
ating function is 


E(z Y ) = w(£z)/w(£). 


(29) 


If Y\,Y 2 ,... is a sequence of independent random variables with the same distribution 
as Y, then 


P {Y l + ---Y k = n) 



n\w(£) k 


(30) 


which appears for instance in (1.3.1) of [36j and [TTl Lemma 3.1]. This implies the 
Kolchin representation of block sizes in a Gibbs partition [HI Theorem 1.2]: for a random 
partition of [n] with the microcanonical Gibbs distribution p n ^ derived from (vij), when 
the k blocks are put in a random order, with each of k\ possible orders equally likely, 
independently of the sizes of the blocks, the sequence of block sizes is distributed as 


Oh, • • •, Y k ) given Yi H- Y k = n. 


(31) 
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Easily from (1271) there is the exponential formula 

OO 

e xw(z) = J2c n (x)z n (32) 

71=0 

where Co (a;) = 1 and C n (x) for n = 1, 2,... is the polynomial 

71 

C n (x) = (n!) -1 ^ B njk (wi,w 2 , ■ ■ ■)x k . 

k =1 

The polynomials C n (x) are then of convolution type , meaning that for n > 1, 

71 

C n (x + y) = C k (x)C n - k (y). (33) 

fc =0 

Assuming now that uq = 1, let ro^ 1 ) denote the compositional inverse of to defined by 
ujC 1 ) (w(z)) = z. According to the Lagrange inversion formula [ 361 Theorem 5.4.2] 

= (34) 

where C n (x) is the sequence of polynomials of convolution type defined by 



Combining (1271) and (1341) we obtain the following lemma. See also Knuth [38] for a 
similar discussion. 

Lemma 4. Each sequence of real weights (w\, w 2 , ■ ■ ■) with w± = 1 admits the repre¬ 
sentation 

w n = (n- l)!C n _i(n) (36) 

for a unique sequence of polynomials C n (x) of convolution type, namely that determined 
by (1351) . in which case for n > 1, 

(n - 11! - 

B n ,k( 1, w 2 , w 3 ,.. .) = —--TrC n _fc(n). (37) 

Many sequences of polynomials of convolution type are known m Examples 2.2.16], 
each providing a sequence of weights (w n ) for which the Bell polynomials can be explic¬ 
itly evaluated using (11171) . As a general rule, weight sequences with manageable formulas 
for the B n j. are those with a simple formula for z/w <y ^ 1 \z) rather than for w(z). A 
rich source of such examples is provided by the theory of Galton-Watson branching 
processes. 
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5.2 Branching Processes 


Given a weight sequence (wj) with w\ = 1 and exponential generating function w(z ) = 
Y^n>i w nZ n /n\, let 


G(z) 


z 

u;( _1 )(z) 


72—0 


(38) 


where Co(l) = 1. Then provided 

C'n(l) > 0 for all n > 1 and ^( 77 ) < 00 for some > 0 


(39) 


the formula 


g(z) 


G(zrf) _ y, d n (l)rj n 
G(V) ^ G{rj) 


(40) 


defines the probability generating function of a non-negative integer valued random 
variable X with distribution 


P(A ’ = " ) = ^f (n = 0,1,2,...) (41) 

Conversely, for each distribution of X with P(A = 0) > 0 and each > 0 it is easily 
seen that there is a unique sequence of convolution polynomials C n such that (14111 holds. 
This is a particular case of m Theorem 2.1.14], Let Y be the total progeny in a Galton- 
Watson branching process with generic offspring variable X. It is well known that the 
probability generating function 


OO 

h(z) := = n)z n 

72—1 


(42) 


can be characterized as the unique solution of the functional equation 


h(z) = zg(h(z )) 


(43) 


which is obtained by conditioning on the number of offspring of the root individual 
[441 Section 6.1]. Note in particular that given the generating function h of the total 
progeny, the offspring probability generating is determined by 

- 5 !=%) <44) 

Also, 

h{ 1) = 1, meaning P(Y < 00 ) = 1, 

if and only if mean of the offspring distribution is at most 1, that is by (14011 


9 '{ 1) = vG'(r])/G(ri) < 1. (45) 

Note that the assumed form m of the offspring distribution forces P(X = 0) > 0 
(since Co(l) = 1), and so forbids the degenerate case with P(A = 1) = 1. Combining 
this discussion with Lemma [4] we obtain: 


18 







Proposition 5. Let w\ = l,u> 2 ,... be a sequence of non-negative weights with expo¬ 
nential generating function w(z) := X/^Li w n z n /n\ and let w/ -1 ) be the compositional 
inverse of w defined by w~ 1 (w(z)) = z. The following two conditions are equivalent: 


(i) there exists f > 0 such that w (£) < oo and the random variables Yj, in Kolchin’s 
representation m of Gibbs partitions, with generating function w{ztf)/w(f), are 
distributed like the total progeny of some Galton- Watson branching process started 
with one individual. 


(ii) The power series 


G(z) 


z 

W ( '~ 1 \z) 


n =0 


has non-negative coefficients C n (l) and G{rj) < oo for some rj > 0. 


When these conditions hold, the offspring distribution is as displayed in m, with gen¬ 
erating function g(z) = G(rjz)/G(r]) for r] = w(f), and g must satisfy g'( 1) < 1. The 
associated evaluation of Bell polynomials is then 

BnA 1, w 2 ,w 3 ,...)= y n—±C n - k {n) = + --- + Y k = n) (46) 

where C n (x) := [ z n ]G(z) x and Y\ + • ■ • + Y k represents the total progeny of the branching 
process started with k individuals. 

Proof. Condition (i) is that w(zf)/w(f) = h(z) where h is derived from some probability 
generating function g via (1431) . Let h~ 1 denote the compositional inverse of h, defined 
by h(h~ l (z)) = z. The equation h(z) = v is solved by z = w^ 1 )(vw(£)) /£, so using 
(fill) g is recovered as 

v vf f vw(f) G(j]v) 

9 h^fv) w(~ 1 )(vw(£)) w(£) w(~ 1 )(vw(^)) G{rj) 

where rj = w(£) so that G{rj) = G(w(f)) = The rest is read from 

Lemma UJ □ 


The conditions of the previous proposition force the branching process to be critical 
or subcritical. For arbitrary with G(rj) < oo, and a branching process with offspring 
generating function g(z) := G(zr))/G(rj), the Lagrange inversion formula shows that the 
distribution of the total progeny of the branching process started with k individuals is 
given by the formula 

P(Li + ... + Yk = n) = [z] n h(z) k = ^[z n - k }g(z) n = l^±C n - k (n) (47) 
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where the Y % are independent and identically distributed according to this formula for 
k = 1. Formula (1471) can be rewritten using LemmaH as 

f n kl 

P(Yi H- Y k = n) = -r—B nk (l,w 2 ,w 3 ,...) (48) 

r] L n! 

which is also consistent with (1301) . Here £ := rj/G(r]), and necessarily rc(£) < r], with 
w(£) = r] and P(Hi + ■ ■ ■ + Y k < oo) = 1 only in the critical or subcritical case g'{ 1 ) < 1 . 
To illustrate these results, consider first the generating function G(z ) = e bz so that 


G{z) x = e bzx = 


b n x r 


71=0 


n\ 


The associated sequence of convolution polynomials is 

C n {x) = b n x n /n\. 


The convolution identity (1331) is the binomial theorem. The corresponding weight se¬ 
quence is 

w n = (n- 1 )\C n -i{n) = b n ~ 1 n n ~ 1 
and the Bell polynomial evaluation is 

= jB§ e ’- k(n) =( l ~~!) <49> 

as indicated earlier in (1141) . The branching process interpretation is that for Poisson 
offspring distribution with mean 6, the distribution of the total progeny of the branching 
process started with k individuals is given by formula (1471) with the above substitutions 
for ?7 = 1 and £ = 1/G(1) = e~ b . 

Consider next the generating function G(z) = (1 + cz) a for some pair of real param¬ 
eters a and c, so that 


G(z) x = (1 + cz) ax = 


n =0 


ax 


n 


The associated sequence of convolution polynomials is 

«•<*> = (“ ) c ” 

In this case, the convolution identity (1331) is called the Chu-Vandermonde identity (see, 
e.g., [25]). The corresponding weight sequence is 


w n = (n- l)!C n _i(n) = (n - 1)!( ^ ^ \ c n 1 
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(50) 




and the Bell polynomial evaluation is 


Bn,k 


(n — 1)! 


Gn—k ijl) 


( n ~ X ) ! ( an \ n-k 

(.k — 1)! \n — k) 


(51) 


Two cases of this formula have well known probabilistic interpretations mmm , as 
indicated in the next two paragraphs. 

If a is a positive integer and c > 0, then C n ( 1) > 0 for all n. For rj = 1 the probability 
generating function (TTUli is 


# 0 ) 


G(z) 

G( 1 ) 


1 + cz 
1 + c 


{q+pz) a 


for p := c/(l + c) and q := 1 —p. This represents the binomial distribution with param¬ 
eters a and p. For a = 1 the evaluation (T5TT) reduces to the previous evaluation (fl3l) of 
the Lah numbers. The branching process in this case is a rather trivial one, with each 
individual having either 0 or 1 offspring. So the random family tree is just a line of 
vertices whose length is geometrically distributed. Cutting the edges in such a segment 
of random length by an independent process of Bernoulli trials yields a geometrically 
distributed number of components, which given their number have independent and 
identically distributed lengths with another geometric distribution. According to Corol¬ 
lary [3] a similar interpretation of the microcanonical Gibbs distributions with weights 
(1501) can be provided in terms of random cutting of edges of a Galton-Watson tree both 
in the case of binomial(o,p) offspring distribution for a = 1, 2, 3,..., and in the following 
case of negative binomial offspring distribution. 

If a = — r and c = —q for r > 0 and 0 < q < 1, again C n ( 1) > 0 for all n. For rj = 1 
the probability generating function (1401) is 


9(z) 


G{z) 
G{ 1) 


/I + cz 
v 1 + C 


—r 


1 ~q 

1 — qz 


r 


which is the generating function of the negative binomial distribution with parameters 
r > 0 and p = 1 — q = 1 + c € (0,1). 

It is easily seen that the coefficients (°)c n are non-negative for all n only in the two 
cases just discussed. So only in these cases does the Bell polynomial (15T1) admit the 
interpretation of Lemma [5] in terms of the total progeny of a branching process for all 
n. Still, the weights w n in (1501) are non-negative for other choices of real a and c, for 
instance a > 1 and c > 0. These weights still define a Gibbs distribution on partitions, 
and there is the Kolchin representation (1311) for the sizes of blocks of such a partition. 
A natural probabilistic construction of such random partitions is provided by Theorem 
[2j The interesting intermediate case, when the coefficients (“)c ? are non-negative only 

for j < j i for some j\ < oo, corresponds to Corollary [3j Then C n { 1) can be set equal 
to 0 for j > j i, and the previous branching process formulas remain valid provided n is 
restricted to n < j\. 
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5.3 Evaluation of a Bell polynomial 

The results of the last two Bell polynomial evaluations (1491) and (1511) . which are implicit 
in the standard theory of branching processes, are unified algebraically by the following 
lemma. The evaluation (1541) is also implicit in [28l (19)-(21)], and plays a key role in 
our treatment of Gibbs models for fragmentation processes. 

Lemma 6. For each pair of real parameters b and c, the polynomials 

^ 71—1 

C'n C ( x ) = ^7 n ( hx + cx ~ °j) ( 52 ) 

n ' j=o 

are of convolution type. For the corresponding weight sequence 

n 

Wn c ■= (n - ly.C^in) = JJ(ic + nb) (53) 

i =2 

there is the Bell polynomial evaluation 

B n,k{ 1, w b 2 ", w b f c ,.. .) = (n - k)\C b n c _ k (n) = | ] (ic + nb). (54) 

^ ' i=k +l 

Proof. This is read from the previous example with generating function G(z) = (1 +cz) a 
for a = b/c + 1. The limiting case c = 0 corresponds to G(z) = e zb . □ 


It is convenient to record here as well an immediate consequence of (1541) : 

Lemma 7. The sequence of weights w n = Wn C is the unique solution of the recursion 
w\ = 1, W 2 = 26 + 2c, and 


w n = — + nb B ni2 (wi,.. .,w n -i) (n = 2,3,...) 


(n- 1) 


(55) 


for B n 2 as in 


6 Proofs 

6.1 Proof of the main result 

The proof of Theorem [2] is based on the next two lemmas. 

Lemma 8. Fix n > 4 and 3 < k < n — 1, and let (rh_i,rh) be a pair of random 
partitions of [n] such that rh_! is distributed according to the microcanonical Gibbs 
distribution p n ,k -1 with weights w\ = 1, W 2 , ■ ■ ■, w n , and 11^ is derived from IIfc_i by 
the recursive Gibbs splitting rule with these weights, and the linear selection rule. The 
following two conditions are equivalent: 


22 



(i) Ilk has the microcanonical Gibbs distribution p U)k with the same weights. 


(ii) The function 


f M 


(m - 1 )w m 

Bm,2(wu ■ . . ,W n - 1 ) 


(2 < m < n — 1) 


(56) 


satisfies 

^2 f(rii + nj) =g(n,k) (57) 

l<z<7</c 

for all sequences of k positive integers (n±,... ,n k ) with Yli=i n i = n an d some 
function g(n , k). 


When these conditions hold, 

g(n,k)= ( " ~ * + 

-^n,k 


(58) 


and the reverse transition from II*. to IIfc_i is governed by the Marcus-Lushnikov coag¬ 
ulation mechanism with kernel Kij = f(i + j). In the case k = 3 these conditions are 
equivalent to 

f(m) = 2c + mb for all 2 < m < n — 1 (59) 

and hence to Wj = w b f c as in (f23l) . for some real b and c. 

Proof. Let n k denote any particular partition of [re] into k blocks, say {Ai,..., A k } with 
ffA % = n t , 1 < i < k. For 1 < i < j < n let n'ff , be the partition of [n] into k — 1 blocks 
derived from {A±,..., A k } by merging of Ai and Aj. The hypothesis of the lemma 
implies that 


P(n fc -i = vr^i^II k 


7 ^) = P(n fc _i = 


( Tlj T Ttj — 1) W ni W nj 

(n-k + l) B ni+nj}2 


and that 

mk~i = 4 J -i) 


Substituting (16TT) into (1601) gives 


w nj+nj n,= l w ni 
Bn,k— 1 W r nW n j 


P(n fc _i = n^Uk = TTfc) 


f{ni + nj)]Xi = i w ni 

(n. k + T)B nk _l 


(60) 


(61) 


(62) 


for / derived from the weights as in (1561) . Summing this probability over all possible 
choices of (i.j) with 1 < i < j < k yields P(IIfc = ir k ), so the equivalence of conditions 
(i) and (ii) is clear. Assuming these conditions hold, (1581) follows at once: dividing (1621) 
by the Gibbs formula for P(IIfc = Tr k ) gives 


P(n fc _i = 7r^ 1 |II fc = -K k ) = f(ni + nj)/g(n,k). 
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as claimed. In the case k = 3, we deduce (1591) from the following lemma, and the weights 
are then determined by Lemma 0 □ 


Lemma 9. Fix n > 3 and let (/(m), 2 < m < n — 1) be a sequence such that for every 
triple of positive integers (rai, n 2 , n 3 ) with ri\ + n 2 + n 3 = n 

f(n 1 + n 2 ) + / (n 2 + n 3 ) + f(n 1 + ra 3 ) = C (63) 

for some constant C. Then there exist constants b and c such that f(m) = 2c + mb for 
every 2 < m < n — 1, and C = 2(3c + nb). 

Proof. For n = 3 or n = 4 the conclusion is trivial, so assume n > 5. Since /(m) is 
defined only for 2 < m < n — 1, it is enough to show that 

fil) ~ f(l - 1) = f(l - 1) - f(l - 2) for all 4 < l < n — 1 (64) 

Let i be the integer part of Z/2 and j = l — i. Then i >2 and either j = i or j = i + 1, 
so j > 2 too. Write EQ(ni, n 2 , n 3 ) for the equation (1631) determined by a particular 
choice of (ni,n 2 ,n 3 ). Keeping in mind that Z = z + j, we have 

EQ(i — l,j — l,n — l + 2) : f(l-2) + f(n-i + l) + f(n-j + l) = C (65) 


EQ{i -l,j,n-l + l) : f{l-l)+f{n-i + l) + f{n-j) = C (66) 

EQ(i, j - l,n - l + 1) : /(Z - 1) + f(n - i) + f(n - j + 1) = C (67) 

EQ(i,j, n — l): f(l) + f{n-i) + f{n-j) = C (68) 

Subtract (1551) from (l66l) to obtain 

/(Z - 1) - /(Z - 2) = f(n -j + 1) - /(n - j) (69) 

and subtract (1671) from (1681) to obtain 

/(/) - f(l - 1) = f(n -j + 1) - f(n - j) (70) 

and Lemma [9] follows. □ 


We can now finish the proof of Theorem 0 Fix n > 4, let (w 3 .1 < j < n — 1) be a 
sequence of positive weights with w\ = 1. 

Suppose first as in condition (i) of Theorem 0 that (Lf^, 1 < k < n) is a -valued 
fragmentation process defined by the recursive Gibbs splitting rule derived from these 
weights, with the linear selection rule, and that the distribution of LL is p Ut k for every 
k. Then condition (ii) of Lemma [8] holds for all 3 < k < n — 1, and in particular for 
k = 3. Lemma [9] now forces (1591) for some b and c, hence Wj = w b f c by Lemma 0 
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Conversely, suppose that (11^, 1 < k < n ) is a "P^-valued fragmentation process 
defined by the recursive Gibbs splitting rule with the weights Wj = w b,c , and the linear 
selection rule. Lemma [7] implies that (|59l) holds, so it is clear that condition (ii) of 
Lemma [8] holds for 3 < k < n — 1 with 

g(n,k) = (k — \){kc + nb). (71) 

Consider the inductive hypothesis that LR_i has the microcanonical Gibbs distribution 
7 J n fc-i with these weights (ui b ’ c ). This is true for k = 3 by assumption. Assuming it true 
for some k, Lemma [8] provides the inductive step from k to k + 1. Thus the distribution 
of ITfc is p^° k for every 2 < k < n — 1. Thus condition (i) of Theorem [2] is satisfied by 

the weights Wj = w b,c . 

Condition (iii) of Theorem [2l that the reversed process is an affine coalescent, is now 
read from the last sentence of Lemma [HJ 

6.2 Proof of Corollary [3] 

Recall first that the distribution of an unconditioned Galton-Watson tree, restricted to 
finite trees, is given by the formula 

P(T = t) = 7T (t) := JJ p n{Vtt) (72) 

v£V{t) 


where 

• t denotes a generic plane tree with a finite number of nodes 

• V ( t ) is the set of nodes of t\ 

• n(v, t ) is the number of children of node v of f; 

• p n is the probability that a node has n children; 

The nodes of t are regarded as unlabelled. But the tree has a root node, and the children 
of each node are assigned a total order, say from left to right. So the nodes of t can 
be identified or listed by some arbitrary convention, such as depth first search, and any 
such convention can be used to rigorously identify the set of nodes V(t) as a subset of 
some ambient countable set. See [42[ 43j for background. Fix n > 4. By definition, F\ 
is T conditioned on = n, so 

P(Fi =t) = 7r(t)l {#t=n} /q(n) (73) 

where q(n) is by definition the probability that T has n nodes: 

q(n) ■= P(#T = n)=J2 n(t)l {#t=n} =Pq^-- (74) 

t n ' 
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In the last formula, read from ( 1481 ) . the weight sequence ( w n ) with w\ = 1 is determined 
as in ( 1441 ) by its exponential generating function w n z n /n\ which is the compositional 
inverse of zpo/g(z) for g(z) := Yl^=oPi z ^ the offspring generating function. The prob¬ 
ability ( 1741 ) is strictly positive for every n > 1, by the simplifying assumption ( 1251 ) on 
the offspring distribution. 

Let F ‘2 be the plane forest of two trees obtained by splitting F\ by deletion of a 
uniformly chosen random edge of F\ , with subtree containing the root put to the left, 
and the remaining fringe subtree put to the right. Then the distribution of F 2 is given 
by the following formula: for a generic pair of plane trees 

P (-^2 q(n)(n — 1) ^f# tl +# t2=n } 


where 


with 


—^ ) ■ 'y 1 r n(v,t) 
vGV (t) 

r m := (m + l)p m+l /p m (0 < m < n - 2) 


(76) 


and the particular offspring distribution display in (1261) is characterized by the formula 


r m = b — (m — l)c (0 < m < n — 2). 


(77) 


Formula (1751) is obtained by conditioning on which vertex v of t\ is the one to which £2 
is attached in t, and given that v has m + 1 children in t, which of these m + 1 children 
is the root of t\. Tossing a fair coin to decide the order of trees in F 2 then yields F 2 
with distribution 


IP(-P2 = 


7r(fi)7r(f2)(S(ti) + T,(t 2 )) 
2 q(n)(n — 1) 




(78) 


On the other hand, the distribution of F 2 * defined by two independent copies of T 
conditioned to have a total of n nodes is given by 

f m = (tuh)) = ^M lj#h+#li . n) (79) 

where 

ir—2 

<?2 W = ^ q{m)q{n - m) = p %—^ 

Z —' n\ 

m= 1 

gives the distribution of the total progeny of the branching process started with two 
individuals, as indicated in (1481) . with B n ^ w 2 , ■ ■ ■, w n -i). Condition (ii) of 

Corollary [2 is the equality in distribution 


F 2 = F*. 


(80) 
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It is clear from (fTSl) and (1791) that this equality in distribution is equivalent to the 
identity 

£(t 1 ) + £(t 2 )= ( "~ 1) “’" (81) 

-On, 2 

for all pairs of trees (ti,t 2 ) with 7r(t 1 )7r(t 2 )l(#ti + #£2 = n ) > 0, where S(t) := 

J2vev(t) r n(y,t) for r m := (m + l)p m+1 /p m . 

(i) =>• (ii). If (i) holds then r m = b + c — me and hence 

E(ti) + S(t 2 ) = nb + nc — (n — 2 )c = nb + 2c (82) 

because in every forest of two trees with n nodes the sum of the numbers of children of 
all nodes is the total number of edges, which is n — 2, and (181 [1 is now read from (15511 . 

(ii) => (iii) and (iv). Assuming (ii), it follows immediately from ([75]). (179|) and (1801) that 
for k = 2 the distribution of life generated by random labelling of tree components of 
Ffe has the Gibbs distribution p n ^ with whatever weight sequence (uy c ) is associated 
with the distribution of the total progeny of the branching process, and that condition¬ 
ally given life the k plane trees associated with these components are distributed like 
independent copies of T conditioned to have the sizes dictated by the block sizes of 
life. Suppose inductively that this is so for some k > 2. The process of random edge 
deletion induces the linear selection rule for components to split, and given that a tree 
component is split, the inductive hypothesis and the assumption for k = 2 implies that 
the component is split into two independent copies of T conditioned to have the right 
size. The implication (iii) =>• (i) of Theorem [2] now provides the inductive step. 

(iii) =4* (ii) is trivial. 

(ii) =>• (i). This follows easily from the identity (|81l) and the following Lemma: 

Lemma 10. Fix n > 3. Let r(m ) be a real-valued function with domain S = {0,1,... j} 
for some 1 < j < n — 2, such that 


J2 r ( n i) = C 

i =1 

for some constant C for each choice of (m,... ,n n ) with n t £ S for all 1 < i < n and 
Y17 =i n i = n — 2. Then there exist real a and b such that r(m) = am + b for all m £ S. 

Proof. Consider for each 1 < m < j — 1 the sequence (ni ,,n n ) with the first n — m — 2 
terms equal to 1, the next term equal to m, and the last m + 1 terms equal to 0. This 
sequence gives 

(n — m — 2)r(l) + r{m) + (m + l)r(0) = C 

and the same holds for m + 1 instead of m. The difference of these two identities gives 

r(m + 1) — r(m) = r(l) — r(0) 


and the conclusion follows. 


□ 
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7 Gibbs fragmentations for random permutations in con¬ 
tinuous time 

Given a symmetric non-negative collision rate function Kj 3 defined for positive integers 
i and j, call the V\ n \ -valued continuous time parameter Markovian coalescent process 
(n t ,t > 0), in which each pair of clusters of sizes i and j is merging at rate Ki.j, 
the Marcus-Lushnikov coalescent with collision kernel Kij- See [1] for background. 
It is assumed throughout this section, in keeping with the definition of a coalescent 
process given in the previous section, that such a coalescent process is started with the 
monodisperse initial condition. That is to say IIo is the partition of [n] into n singletons. 
Both Marcus and Lushnikov worked with the corresponding ^-valued process rather 
than a P[ n ]-valued process, but there is no difficulty in translating results from one 
state-space to the other, by application of the standard criterion for a function of a 
Markov process to be Markov. Lushnikov [39] found the remarkable result that for a 
collision kernel of the form Kj j = if(j) + jf(i ) for each t > 0 the distribution of Ilf is 
of the form 


n 



(83) 


k=1 


where Pn,k{^",Wj,j = 1,2,...) denotes the microcanonical Gibbs distribution on V\ n M 
with weights w 3 , and the functions q n ^{t) = JP(#IIf = k) and the weights Wj(t) are 
determined by a system of differential equations. As mentioned earlier, Hendriks et al. 
[ 28 ] showed that for LQj = a + b(i + j) for constants a and b the wj(t) can be chosen 
independently of t as Wj(t) = Wj where Wj is determined by a and b via formula (f23l) 
for c = a/2. 

Kingman [ 34] studied the particular case of the Marcus-Lushnikov coalescent with 
a = 1 and 6 = 0. In this process, at any given time t, given that ffllt = k, each of the 
k{k— l)/2 cluster pairs in existence at time t is merging at rate 1. Call this process with 
state space "P[ n j Kingman’s n-coalescent, Motivated by applications to genetics, Kingman 
[34 ] proposed the following construction. Given a coalescent process (n t , t > 0), suppose 
that each cluster of Hf is subject to mutation at rate 9/2 for some 9 > 0. Now define a 
random partition of [n] by declaring that i and j are in the same block of if and 
only if no mutation affects the clusters containing i and j in the interval (0, t 13 ) where 
Tij is the collision time of i and j in the coalescent process (Hf,t > 0), that is the first 
time t that i and j are in the same cluster of Hf. Kingman obtained the following result: 

Proposition 11. (Kingman [35]) Suppose that (Hf,t > 0) is Kingman’s n-coalescent. 


Then 



(84) 


for each partition n of [n] into k components of sizes n \,..., 
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The distribution of lie defined by (1811) first appears in m and is known as Ewens’ 
Sampling Formula with parameter 9. This distribution has long been recognized as an 
essential tool in population genetics (see, e.g. m recently), and has been applied in 
a wide variety of contexts in probability. Note that this distribution is a particular 
mixture over k, with mixing coefficients depending on 9, of the microcanonical Gibbs 
distributions on TT*.] with weights (j — 1)!, as interpreted in Example El Recall that 
this distribution has been constructed starting from (IR,i > 0), where due to Example 
[3l for all t, n t is a mixture over k, with mixing coefficients depending on t, of the micro- 
canonical Gibbs distributions on < k < n, with the different weight sequence 

(j !, j > 1). It does not seem obvious from a combinatorial perspective why there should 
be such a connection between the Gibbs models with these two weight sequences. 

The random partition lie (which is sometimes referred to as the random allelic 
partition), and more generally the fragmentation process (IT,0 > 0) discussed below, 
can be defined starting from any coalescent (IR), but there seems to be a manageable 
formula for the distribution of rig only for Kingman’s coalescent. See however the recent 
work of Mohle [H] where an explicit recursion is given for the random allelic partition 
obtained from a A-coalescent (i.e., coalescent with multiple collisions). See also the 
related work of m as well as M which has some explicit asymptotic formulae in the 
particular case of a beta-coalescent. 

As a development of Kingman’s result, there is the following proposition. See also 
[23] (or [T3, Exercise 5.2.1]) for an alternative construction. 

Proposition 12. There exists a Gibbs fragmentation process (IR,0 > 0) with weight 
sequence (( j — 1)!, j > 1) such that for each 9 > 0 the distribution of IR is the Gibbs 
distribution on with these weights as displayed in (IM1) . 

Proof. Given the path of a Kingman coalescent process (IT, t > 0), construct a random 
tree T as follows. Let the vertices of the tree T be labelled by the random collection 
V of all subsets of [n] which appear as clusters in the coalescent at some time in its 
evolution. Because the coalescent develops via binary mergers, starting with n singletons 
and terminating Ilf = [n] for all sufficiently large t, the set V comprises the collection 
of all n singleton subsets of [to], which are the leaves of the tree, the whole set [to] which 
is the root of the tree, and n — 2 further subsets of [to], whose identities depend on 
how the coalescent evolves, which are the internal vertices of the tree. The tree T has 
to + 1 + (to — 2) = 2to — 1 vertices all together. Associate with each subset v of [to] that 
is a vertex of the tree the time t(v) at which the coalescent forms the cluster v. Thus 
t(v) = 0 if and only if v is one of the to singleton leaf vertices, f([n]) = inf{t : fflit = 1}, 
and the collection of times t(v) as v ranges over the to — 1 non-leaf vertices of the tree 
is the set of times t at which the process (#1R,t > 0) experiences a downward jump. 
For each non-leaf vertex v in T, let there be exactly two edges of T directed from v 
to v\ and V 2 , where v\ and V 2 are the two clusters which merged to form v. Let each 
vertex v of T be placed at height t(v) equal to the time of its formation, and for i = 1,2 
regard the directed edge from v to Vi as a segment of length t(v)—t(vi). Now Kingman’s 
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construction of lie amounts to supposing that there is a Poisson process of cut points 
on the edges of this tree, with rate 9/2 per unit length, and identifying the blocks of lie 
with the restrictions to the set of n leaves of T (identified with [n]) of the components 
of the random forest obtained by cutting segments of T at the Poisson cut points. Now 
conditionally given the tree T, construct the Poisson cut points simultaneously for all 
9 > 0 so that for each edge of the tree of of length i the moments of cuts of that edge 
form a homogeneous Poisson process of rate 19/ 2, and these processes are independent 
for different edges. Then lie has been constructed simultaneously for each 9 > 0 in such 
a way that lie is obviously a refinement of 11^ for 9 > <f. Since with probability one 
there are no ties between the times of cuts on different segments, it is clear that the 
process (lie ,e> 0) develops by binary splits. Thus (lie ,e> 0) is a Gibbs fragmentation 
process. □ 


While the one-dimensional distributions of this process (lie, 9 > 0) are given by 
Ewens’ sampling formula (184(1 . the two and higher dimensional distributions seem dif¬ 
ficult to describe explicitly. In particular, a calculation of the simplest transition rate 
associated with the process (lie, # > 0), provided below, shows that this rate depends 
on 9. It seems quite difficult to give a full account of all transition rates of (lie, # >0), 
though their general form can be described and a method for their computation for small 
n will be indicated. For n> 2 the process (lie,# > 0) turns out to be non-Markovian, 
so its distribution is not determined by its transition rates. 

In connection with Proposition [12] and such calculations, the following problem 
arises: 

Problem 3. Does there exist for each n a V\ n ^-valued Gibbs fragmentation process 
(n*,i < k < n) with weight sequence ((j — 1 )!,j >1)? 


7.1 Calculations with the tree derived from Kingman’s coalescent. 

The following calculations (Proposition [13]) show that for n > 4, the discrete-time chain 
embedded in (lie ,0> 0) (that is, the sequence of successive states of (lie, # > 0), or its 
discrete skeleton) does not provide a solution to Problem [3] 

Let T n denote the random tree derived as in the proof of Proposition [12] from King¬ 
man’s n-coalescent (n t ,t > 0), and recall the definition of the fragmentation process 
(lie, 9 > 0). Let 0 be the time of the first cut in this process, and let II© be the state 
of the fragmentation at this random time. Thus almost surely 11© is a partition with 
two blocks. 

Proposition 13. The law of © is determined by 


P(0 € d9)/d9 


(n~l)! yi 1 
[9 + l] n —l i=1 i + 9 


(85) 
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For it with two components of sizes n\ and ni 


and 




p(n 0 = 7T | 0 = o) = 


CTjElkW )- 1 


p(n 0 = 7r) 


(n- 1)! f°° de 

(nj Jo [O + ljn-1 


^ [(7-1 1 ) + (T-'i 1 ) 

Si ( n 7 1 )(’ + 9 ) 


( 86 ) 


(87) 


Proof. We only provide a sketch of the calculations leading to this result as they are 
somewhat tedious. For 1 < k < n let T*, = inf {t : #11* = k}. Let Si = (i + 1) (T t — Tj+i), 
which we call the i th stratum of the tree. It is clear that the total length of all segments 
in the tree T is 

71—1 

L n '■= — Ti + 1 ) ( 88 ) 

2—1 

From the definition of the underlying coalescent process (Ilf, t > 0), the random variable 
Tj_|_i — Ti has exponential distribution with rate i(i + l)/2, and these random variables 
are independent for 1 < i < n — 1. It follows that 


£'exp(--L r , 


tt _ + l)/2 _ 

1 = 1 0(i + l)/2 + i(i + l)/2 


(n — 1)! 

\Q + l]n—1 


(89) 


where [9 + l] n _i = 0^=1* + *)• On the other hand, given L n , the Poisson pro¬ 

cess with rate 9/2 per unit segment length in the tree has no points with probability 
exp(— (9/2)L n ). So the expectation calculated in (1891) is just the probability that lie 
is the partition of [n] with one component, or in other words that 0 > 9. Thus (185H 
follows by differentiation. 

Now, let I denote the index of the stratum in which the first cut point falls at time 
0. Then it follows from the representation of L n as the sum of independent exponential 
variables L n = Si that the sum over i in (1851) corresponds to summing over the 

possible values i of I. That is, for 1 < i < n — 1, 


P(© € 9,I = i)/d9 


(n — 1)! 1 

[9 + l]n—l i + 9 


(90) 


and hence 


P(I = { | © = 9) 


o i+s r 1 


(91) 


Observe now that given 0 = 0 and / = i, the partition II 0 consists of two components, 
obtained as the restriction to [n], identified as the set of leaves of the tree T, of the 
two components of T separated by the cut at time 0 in stratum i of T. To be precise, 
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IIq = {C, [n] — C} where C is the cluster of lit in existence during the time interval 
(Tj + i,Tj) corresponding to the segment of T which is cut at time 0. This C is one of 
the clusters of II* +1 , where (n£, 1 < k < n) is the discrete skeleton of (n t ,t > 0). Now 
by construction of the Poisson cutting process, and the fact that the discrete skeleton 
(nj;,l < k < n) of (U t ,t > 0) is independent of (#11*, t > 0), it is clear that the 
conditional distribution of II* +1 given 0 = 0 and I = i is identical to its unconditional 
distribution, that is the Gibbs distribution on V\ n i + \\ with weights (j\,j > 1), and that 
C is one of the i + 1 components of n* +1 picked by a mechanism independent of the 
sizes of these components. Therefore, 


P(#C = m | 0 = 0,/ = *)= P(#C i+ i = m) (92) 


where G\+i is a random component of II*, j. After some combinatorics, it follows easily 
that the conditional distribution of IIq given 0 = 9 and I = i is given by 


p(n© = 7 T | 0 = 0,i = *) 



(93) 


Combining this expression with (1911) shows that the conditional distribution of IIq given 
0 = 6 is given by ([86]) . We now easily obtain (1571) from (1861) and (185|) by integration. □ 

We now briefly explain why it can be deduced from the explicit formula (1871) that 
the discrete chain embedded in (IR, 9 > 0) is not a Gibbs fragmentation. We must 
simply show that (|87l) does not coincide with the Gibbs microcanonical distribution 
p n ,2 associated with the weight sequence Wj = (j — 1)!. Let J© denote the size of a 
component of n© picked by the toss of a fair coin independent of II©. Then using the 
above, if n\ = 1 and 712 = n — 1, 


P(J© = 1) 



( 71—1 


\ i=1 


% + 


+ 


1 + 


After a few lines of algebra, using some integration by parts and some partial fractions, 
we can conclude that 


P(J© = 1) 



n—1 


!)• a ' n ' i lo S* + o- 


i —1 


(94) 


where a* = (-1)* 1 ( i ” 1 )/n!. 

On the other hand, let II have the Gibbs (n, 2,w) distribution with Wj = (j — 1)!, 
and J is the size of a randomly picked component. Remark that by decomposing on 
the size of the cycle containing 1, 

n—1 

B n,2 = 

3 = 1 


n — 1 
3 ~ 1 


(,3 ~ l)K ra - 1 - J)- = iji - l)!I7„-i 
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where H n _\ := J2j =i 1 /]■ Since the number of permutations with exactly two cycles 
one of which has size 1 is n{n — 2)!, we conclude that P(J = 1) = \ ( n -X)H - • This 
is incompatible with (19411 . Indeed if this was to be equal to right-hand side in (1941) 
one would get log)}}//) 1 i ri ) = q for some rational number q and = (—1)*( . G Z, 
and thus e q = q' for some (other) rational numbers q and q'. This contradicts the 
transcendence of e. 

Thus the distribution of the partition of [n] into two parts obtained at the time of 
the first split is not the common distribution of Hq given jfHo = 2 for all 9 > 0. In 
particular, for n = 4 the formulas above give: 

P( Je = 1) = T(Te = 3) = 3(- log 2 + i log 3) + i 

and 

P (Je = 2 ) = 6 log 2 — 3 log 3 

whereas 

P( J = 1) = P(J = 3) = 4/11 
P(J = 2) = 3/11 

We conclude that the discrete skeleton of (llq) 0 >o, he., the discrete fragmentation chain 
embedded in it, does not give a discrete Gibbs fragmentation associated with w* = 

U - !)•• 


7.2 A reformulation with walks on the symmetric group 

In view of the combinatorial interpretation of Example O Problem [3] can be restated as: 

Problem 4. Does there exist for each n a sequence of random permutations ( ak, 1 < 
k < n) such that oq has uniform distribution on the set of permutations of [n] with k 
cycles, and for k < t the partition generated by the cycles of ai is a refinement of cr^ ? 

This problem may be partially reformulated in terms of random walks on the sym¬ 
metric group. Suppose we consider the Cayley graph G n of the symmetric group induced 
by the set of generators S = {all transpositions}, that is, we put an edge between two 
permutations a and ir if and only if a may be written as a = t • n for some transposition 
r. It is well-known that multiplying a permutation by a transposition can only result 
in a coagulation or a fragmentation in the cycle structure. More precisely, suppose 
C = (x\,... ,Xk) is a cycle of the permutation ir. If we multiply by the transposition 
r = ( Xi,Xj) then the resulting cycle structure in a is the same as that of n except that 
C breaks into (x\, ..., Xj_i, Xj, Xj+i, • • •, x^) on the one hand and (xj,..., Xj- 1 ) on the 
other hand. Conversely, suppose C = (aq, ..., Xk) and C' = (y i,..., yi) are two cycles of 
7r and we multiply it by the transposition ( Xi,yj ). In the resulting permutation, C and 
C' will be replaced by a unique cycle C" = (aq,..., Xj_i, yj, yj+i, ■ ■ ■, yj-i,Xi ,..., aq). 
In particular, any (random) walk on G n may be viewed as a coagulation and fragmen¬ 
tation process on the cycle structure of the permutation. Moreover, a well-known result 
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due to Cayley states that if a is a permutation then the graph distance between a and 
the identity permutation I (i.e., the minimum number of edges one must cross to go 
from I to a on G n ) is simply n — ^cycles of a. As a consequence, by considering a 
time-reversal of the process, to solve Problem [I] it is enough to construct a random 
process (Ofc)o<fc<n-i on G n which has the following two properties: 

1 . The sequence (ofc,0 < k < n — 1) is a random walk on G n , in the sense that if 
<7fc = a at the next stage a can only jump to one the neighbors of a. 

2. The permutation has the following marginal distribution: at stage k the dis¬ 
tribution of <7fc is uniform on the sphere of radius k about the identity, that is the 
set of all permutations whose distance to the identity is k. 

Property 1 ensures that the cycles of perform a coagulation-fragmentation process. 
In conjunction with property 2, since must be at distance k, it must be the case that 
all jumps of a are produced by some fragmentation. Moreover, by Cayley’s result for the 
distance of a permutation to the identity, if Ok is uniform on the sphere of radius k then 
its cycle structure is a realization of the Gibbs distribution © with weight sequence 
Wj = (j — 1)! (Note however this is not strictly equivalent to Problem 0] since not all 
fragmentations at the level of partitions can be represented by moving along some edge 
of G n . For instance, it is impossible to get from the permutation (1 2 3 4) to the 
permutation (1 3)(2 4) in one step). 

In this context, a very natural process to consider is the simple random walk on G n , 
conditioned to never backtrack. In other words, starting from the identity, at each step 
choose uniformly among all edges that lead from distance k to distance k + 1. Although 
this may seem a very natural candidate for properties 1 and 2, this is far from being the 
case. Much is known about this process, and in particular it has been shown in [8j that 
the distribution of this process at time k = [an\ for any 0 < a < 1 is asymptotically 
singular with respect to the uniform distribution on the sphere of radius k. 
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